Talend Big Data – Machine learning

SubscriptionThis content is available for Talend Academy subscription users.Instructor-ledThis content is available as instructor-led training. Open learning plan - EN

 

Talend provides a development environment that lets you interact with many source and target big data stores, without having to learn and write complicated code.

 

This course covers the implementation of machine learning algorithms in Big Data Batch Jobs using the Spark framework.

 

Duration: 1 day (7 hours)

 

Target audience: Anyone who wants to use Talend Studio to industrialize machine learning algorithms

 

Prerequisites: Completion of Talend Data Quality Essentials or Talend Big Data Basics

 

Learning objectives: After completing this learning plan, you will be able to:

  • Connect to a Hadoop cluster from a Talend Job

  • Use context variables and metadata

  • Read and write files in HDFS in a Big Data Batch Job

  • Configure a Big Data Batch Job to use the Spark framework

  • Create and test recommendation models

  • Create and test classification models

  • Use a machine learning algorithm to deduplicate data

 

Training modules: To complete the learning plan, take the following training modules: